Add GSoC 2026 introductory blog post for Soumyadip Niyogi#1929
Add GSoC 2026 introductory blog post for Soumyadip Niyogi#1929captainvogon wants to merge 2 commits into
Conversation
✅ Deploy Preview for earnest-hotteok-b1e1bf ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
There was a problem hiding this comment.
Pull request overview
Adds a new GSoC 2026 participant introductory blog post for Soumyadip Niyogi, describing the planned work to integrate CMS Combine tooling into the FCCAnalyses workflow.
Changes:
- Added a new GSoC 2026 blog post with Jekyll front matter and an introductory “About/Project” section.
- Documented project scope (datacard/workspace generation + Combine packaging into Key4hep) and community bonding activities.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
kjvbrt
left a comment
There was a problem hiding this comment.
Nice blog:) Welcome Soumyadip!
| | --- | --- | | ||
| | Name | [Soumyadip Niyogi](https://github.com/captainvogon) | | ||
| | Organisation | [IISER Thiruvananthapuram](https://www.iisertvm.ac.in/), [CERN](https://home.cern/), [HSF](https://hepsoftwarefoundation.org/) | | ||
| | Mentor | [Juraj Smiesko](https://github.com/kjvbrt) (CERN), [Jan Eysermans](https://github.com/jeyserma) (MIT) | |
| ## Introduction | ||
| Hi! I am Soumyadip Niyogi, a third-year BS-MS Physics student at IISER Thiruvananthapuram, India, and this summer I will be working with the HEP Software Foundation as a Google Summer of Code contributor at CERN. | ||
|
|
||
| My research so far has lived mostly in cosmology: MCMC-based constraints on modified gravity, Bayesian inference on Gamma-Ray Burst datasets, and simulating relativistic electrons in galaxy cluster magnetic fields. This project is my first serious step into collider physics, and I could not have asked for a better starting point. |
There was a problem hiding this comment.
Can you put some link which explains "MCMC-based constraints"
| My research so far has lived mostly in cosmology: MCMC-based constraints on modified gravity, Bayesian inference on Gamma-Ray Burst datasets, and simulating relativistic electrons in galaxy cluster magnetic fields. This project is my first serious step into collider physics, and I could not have asked for a better starting point. | ||
|
|
||
| ## The Project | ||
| The Future Circular Collider (FCC) programme at CERN uses FCCAnalyses as its primary analysis framework. Built on ROOT's RDataFrame, it efficiently processes simulated EDM4hep collision events into histograms. But the next step, turning those histograms into actual physics results like discovery significances or exclusion limits, requires CMS Combine. Combine is a separate statistical tool that FCC physicists currently have to interface with manually. |
There was a problem hiding this comment.
"EDM4hep collision" -> "EDM4hep based", can you provide EDM4hep link here?
"requires CMS Combine" -> "requires a statistical fitting tool, like CMS Combine" (just to express that CMS Combine is not the only option).
| ## The Project | ||
| The Future Circular Collider (FCC) programme at CERN uses FCCAnalyses as its primary analysis framework. Built on ROOT's RDataFrame, it efficiently processes simulated EDM4hep collision events into histograms. But the next step, turning those histograms into actual physics results like discovery significances or exclusion limits, requires CMS Combine. Combine is a separate statistical tool that FCC physicists currently have to interface with manually. | ||
|
|
||
| That manual step is what I am here to automate. By the end of the summer, the goal is to have a native Python interface inside FCCAnalyses that reads RDataFrame output histograms and automatically generates the datacards and RooFit workspaces that Combine needs. The first piece of work is packaging Combine itself inside the Key4hep software stack so it is available as a standard dependency, requiring no separate installation. |
| ## Community Bonding: Getting Up to Speed | ||
| I have been using the community bonding period to understand the landscape on both sides of the bridge I am building. | ||
|
|
||
| On the FCCAnalyses side, I have been working through the benchmark $Z(\mu\mu)H(bb)$ analysis tutorial. This involved configuring the histmaker, stacking signal and background processes with the plotting utilities, and migrating my workspace from AFS to EOS to handle larger ROOT files. Generating recoil mass plots and studying b-tagging score distributions has given me a concrete picture of exactly what data structures my integration code will need to consume. |
There was a problem hiding this comment.
"configuring the histmaker" -> "executing the histmaker style analysis".
Will this $Z(\mu\mu)H(bb)$ get properly rendered? (Probably a question for the repo maintainers).
|
|
||
| On the Combine side, I have been reading through the datacard format documentation and studying how RooFit workspaces encode shape systematics-the machinery that will form the core of the later project phases. I also had to tackle my first technical hurdle: setting up the Combine environment on CERN's `lxplus9` clusters without conflicting with the host's Key4hep stack. By isolating the environment using a Singularity container (`--cleanenv`) and building a local CMSSW workspace, I successfully compiled the tool and ran my first asymptotic limits check! | ||
|
|
||
| I will also be attending the group meeting with the FCCAnalyses team to introduce myself to everyone else, which should be highly beneficial for my future workflow and collaboration. |
There was a problem hiding this comment.
its meeting of the whole CERN EP-FCC group, not just FCCAnalyses :)
Added the GSoC 2026 Intro Blog for Soumyadip Niyogi (Integration of Combine with FCCAnalyses).